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A 5.2-kb region at the 3' terminus of the salmon pancreas disease virus (SPDV) RNA genome has been cloned and 
sequenced. The nucleotide and predicted amino acid sequences show that SPDV shares considerable organizational and 
sequence identity to members of the genus alphavirus within the family Togaviridae. The SPDV structural proteins encoded 
by the 5.2-kb region contain a number of unique features when compared to other sequenced afphaviruses. Based on 
cleavage site homologies, the predicted sizes of the SPDV envelope glycoproteins E2 (438 aa) and E1 (461 aa) are larger than 
those of other alphaviruses, while the predicted size of the alphavirus 6K protein Is 3.2 K (32 aa} in SPDV. The E2 and E1 
proteins each carry one putative N-linked glycosylation site, with the site in E1 being found at a unique position. From amino 
acid sequence comparisons of the SPDV structural region with sequenced alphaviruses overall homology is uniform, ranging 
from 32 to 33%. While nucleotide sequence analysis of the 26S RNA junction region shows that SPDV is similar to other 
alphaviruses, analysis of the 3'-nontranslated region reveals that SPDV shows divergence in this region. t> 1999 Academic Pr™. 



Introduction. Pancreas disease (PD) of farmed Atlantic 
salmon, first described in Scotland in 1984 [18), has since 
been reported throughout Western Europe and North 
America (11, 24). The disease causes major economic 
losses with up to 50% mortality in first year salmon 
smolts in Ireland (29). Postsmolt salmon affected with PD 
appear runted and characteristic histological lesions In 
the pancreas, heart, and muscle are observed. The first 
cell culture isolate of a virus, designated salmon pan- 
creas disease virus (SPDV) and later shown to be the 
causal agent of the disease {17), was made in this lab- 
oratory from diseased fish (19). On the basis of physio- 
chemical characteristics and structural morphology, 
SPDV was tentatively classified as a toga-like" virus (19). 

In this paper we report the production of cDNA clones 
specific to SPDV RNA and, on the basis of the sequence 
analysis of the 5.2-kb region at the 3' terminus of the 
SPDV genome, we have identified SPDV as an alphavi- 
rus, the first reported in fish. The alphaviruses described 
to date are arthropod-borne viruses that contain a posi- 
tive sense, single-stranded genome of approximately 12 
kb. The four nonstructural proteins (nsP1-4) that are 
responsible for replicating the virus RNA are encoded in 
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the 5'-terminal two thirds of the genome. Virus replica- 
tion involves the synthesis of a subgenomic (26S) RNA 
species of approximately 4 kb that corresponds approx- 
imately to the 3' third of the genome. This subgenomic 
RNA is translated into a polyprotein, approximately 130 
kDa In size, which is cleaved proteolytics lly into the 
capsid protein and envelope glycoproteins E2 and E1 
(25). 

The sequence analysis, reported in this paper, com- 
pares the structural protein gene region of the SPDV 
genome with those of other sequenced alphaviruses. 

Results. Cloning strategy. Sequence analyses indi- 
cated that 90% of cDNA clones prepared from RNA 
extracted from gradient-purified SPDV exhibited strong 
protein homologies to previously sequenced alphavi- 
ruses. Three of these clones, N11, N38, and N50, mapped 
to the 5.2-kb region at the 3' terminus of the consensus 
alphavirus genome and knowledge of the sequences 
contained by these clones was used to produce three 
overlapping cDNA clones, pBS1150, pBS5038, and 
pBS38R. Clone pBS38R was produced using a 3' RACE 
reaction, the success of which depended on the SPDV 
genome possessing^ polyadenylated tract at its 3' ter- 
minus (Fig. 1). Clone pBS1150 is approximately 2.4 kb 
and encodes the C-tenminus of nsP4, the capsid, E3, and 
the N-terminus of E2. Clone pBS5038 is approximately 
1.6 kb and encodes the C-terminus of E2, 6K, and the 
N-termtnus of E1 and pBS38R is approximately 1.1 kb and 
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FIG. 1. Cloning and sequencing strategy for SPDV The position where the clones Nil, N50, and N38 map to the 3' terminus of the aiphavirus 
genome is shown. Sequence Information from these clones was then used to produce pB3H50, pBS5038, and pBS3ER, Sequencing was carried out 
by primer walking from the ends of each of these clones as Indicated by the arrows 



encodes the C-terminus of El, the 3'-nontranslated re- 
gion, and the poly(A) tract (Fig. 1). 

Nucleotide Sequence of the 3' -Terminal 5.2-kb Region 
of SPDV Genome. The complete translated nucleotide 
sequence of the 3'-terminal 5.2-kb region of SPDV RNA 
is presented in Fig. 2. The genome organization of the 
virus is similar to that of other alphaviruses (Fig. 1). 
Following the stop codon of the putative RNA-dependent 
RNA polymerase, nsP4, there is an nontranslated region 
of 35 nucleotides followed by an open reading frame of 
3852 nucleotides encoding the structural proteins. A 3'- 
nontranslated region of 91 nucleotides, including the 
stop codon, was identified prior to the poly(A) tract The 
start of the SPDV26S RNA was identified by alignment to 
other alphaviruses. The 26S RNA is 3972 nucleotides 
excluding the poly(A) tract and is shorter than most 
described alphaviruses. 

Comparison of SPDV Structural Proteins with Other 
Aiphavirus Proteins. The putative cleavage sites be- 
tween the structural proteins of SPDV (Fig. 2] were 
predicted on the basis of homologies shared with 
other alphaviruses (data not shown). Cleavage at 
these sites would result in SPDV having larger El and 
E2 envelope glycoproteins than any previously re- 
ported aiphavirus and more notably a smaller 6K pro- 
tein (Table 1). The percentage of amino acid sequence 
identity was determined using pairwise comparisons 
between the structural region of SPDV and that of 
other alphaviruses and was shown to be uniform, with 
percentage identities falling between 32 and 33% (Ta- 
ble 2). The C-terminal 394 amino acids of nsP4 of 



SPDV show 59-62% sequence identity to the equiva- 
lent region in other alphaviruses (data not shown). 

Comparison of the putative glycosylation sites in the 
envelope proteins shows SPDV to be unusual in having 
no N-linked glycosylation sites in E3. In alphaviruses 
such as SFV and SIN, E3 has been shown to possess a 
conserved site at amino acids 11-14. The E2 and E1 
glycoproteins of SPDV each contain one site. The N- 
finked site within E2 is at amino acid 319, which is similar 
to that found in SIN, WEEV, and EEEV However, within E1 
of SPDV, the N-linked site is at amino acid 35. a finding 
which contrasts with that observed in other sequenced 
alphaviruses, which are glycosylated at amino acids 
139-141. 

With most alphaviruses, a major antigenic region is 
located between amino acids 170 and 220 within the E2 
glycoprotein and is responsible for virus neutralization 
(27, 12). Examination of the sequence of amino acids 
from 194 to 234 within the E2 specified by SPDV shows 
some sequence similarity to the antigenic regions of 
other alphaviruses in that It contains two cysteine resi- 
dues and one glycine residue which are invariant in this 
region among alphaviruses (26). 

Like other alphaviruses the amino acid sequence of 
the N-terminus of the capsid protein contains a relatively 
high proportion of basic residues, which are believed to 
have a role in the interaction with genomic RNA during 
encapsidation (8, 5, 22). The SPDV capsid also contains 
the C-terminus serine protease motif responsible for 
release of the capsid protein from the structural polypro- 
tein (10, 3). The SPDV E1 glycoprotein contains the con- 
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FIG. 2. The complete translated nucleotide sequence of the SPDV structural region and the C-terminus of nsP4. The nucleotide sequence is 
numbered from the 5' terminus. Amino acids are numbered from the start of each protein. This nucleotide sequence was submitted to the GenBanlc 
and has been assigned Accession No. AJ012631. 
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TABLE 1 

Size in Amino Acids of SPDV Structural Proteins Compared 
to Nine Other Alphaviruses 



Capsid E3 E2 6K E1 



SPDV 282 71 438 32 461 

SIN 264 64 423 55 439 

SFV 267 66 422 60 438 

VEE 275 59 423 55 442 

EEE 259 63 420 56 441 

WEE 259 60 423 55 439 

RRV 270 63 . 422 60 438 

ONN 260 64 423 61 439 

AURA 267 61 424 54 438 

BFV 253 68 421 58 439 



Note. Predictions were made on the basis of conserved cleavage 
sites between the structural proteins. 



served hydrophobic stretch in its N-terminus believed to 
have a role in membrane fusion (7, 25). 

Comparison of SPDV Noncoding Nucleotide Se- 
quences with Those of Other Alphaviruses. The nucleo- 
tide sequence of the 26S RNA junction region of SPDV 
was compared with other alphaviruses (Fig. 3A). This 
region, conserved among alphaviruses, contains the C- 
terminus of nsP4 and the beginning of the 26S RNA and 
has a role as promoter for the transcription of the sub- 
genomic 26S RNA (20, 15). SPDV is similar In this region 
but not identical, showing nine differences to SIN and 
five differences to EEEV and WEEV over a 24-nucleotide 
stretch. SPDV also differs at three residues which are 
conserved in 10 other alphaviruses. 

The 3'-nontranslated region of alphaviruses is be- 
lieved to play an important role in virus replication. The 
3'-nontranslated region of SPDV is one of the shortest 
reported for an alphavirus at 91 nucleotides, with Pixuna 
virus being the shortest at 77 nucleotides and Bebaru 
virus being the longest at 609 nucleotides (23). Whereas 



most alphaviruses have two or three copies of repeat 
elements in their 3'-nontranslated regions (21), SPDV 
Jacks repeat sequence elements and shows little homol- 
ogy to other alphaviruses in this region prior to the 
19-nucleotide terminal sequence. 

The last 19 nucleotides prior to the poly(A) tract are 
highly conserved among alphaviruses (21, 15). However, 
SPDV shows significant divergence in this region (Fig. 
3B), in that it contains a nine-nucleotide stretch not seen 
in any of other sequenced alphaviruses. When this in- 
sertion and other small deletions are taken into consid- 
eration, SPDV shares nucleotide identity with other al- 
phaviruses at 10-14 positions. Also, the 3'-termina! C 
residue prior to the poly(A) tract, which is believed to be 
necessary for replication of SIN virus (13), is present in 
. the SPDV genome. 

Discussion. In this paper we report the cloning, se- 
quencing, and sequence analysis of the 3'-terminal 
5.2-kb region of the genome of SPDV, a virus that infects 
and causes economically important disease in farmed 
Atlantic salmon. On the basis of the genomic organiza- 
tion of this region and protein homologies observed 
between the structural proteins of SPDV and other well- 
characterized alphaviruses, we have concluded that 
SPDV can be classified as an alphavirus, the first alpha- 
virus reported in fish. On the basis that PD of farmed 
Atlantic salmon and sleeping disease (SD) of freshwater- 
reared rainbow trout, Oncorhynchus mykiss, share simi- 
lar histopathology and that acquired cross-protection 
has been demonstrated, Boucher and Baudin Laurencin 
(/) have reported that the two diseases are caused by 
similar or identical agents. A comparative nucleotide 
sequence analysis will be useful for determining whether 
the virus associated with SD is an additional fish alpha- 
virus and, if so. for establishing the extent of its similarity 
to SPDV 

The biological properties of SPDV are very different 
from those of other alphaviruses. For example, most 



TABLE 2 

Percentage Identities between the Structural Proteins Showing the Relationship of SPDV to Other AJphavlruse 



VEE 
ONN 
RRV 
AURA 



Note. Amino acid sequence information was obtained from Accession Nos. J02363(SIN), J02361(SFV), J04332(VEE}. J03854{WEE) X63135(EEEV) 
M20162{RRV), M20303(ONN), S78478(Aura). U73744(BFV), and AJ012631 (SPDV). 
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A SPDV UAUUGG UUUUAAAAUUUUCAAUA C<A>n 
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SFV A G-GGU A UGG ONN AU— U- U UUU -<A)n 

ONN - - 0 - G G-GG A GG MID CCA-U- G UUC - [A[n 
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FIG. 3. (A) Comparison of the 263 junction region of SPDV and 10 other alphaviruses. Dashes indicate identical nucleotides. Those nucleotides 
found only in SPDV are shown in bold. (B) Comparison of (he 19-nucleotide conserved region at the 3' terminus. Gaps have been introduced for 
alignment of this region of SPDV to other alphaviruses. Dashes indicate identical nucleotides. 



previously described alphaviruses are arthropod or in- 
sect transmitted, generally by mosquitoes (2). Given that 
most of the fish farms experiencing SPDV-related dis- 
ease are located in Northern Europe, for climatic rea- 
sons, it is extremely unlikely that mosquito-like insects 
play active roles in the transmission of this virus. How- 
ever, the role that the ectoparasitic sea lice species 
(Lepeophtheirus salmonls) may play in the transmission 
of SPDV remains to be determined. Due in part to the fact 
that they can replicate in insect as well as animal hosts, 
some of the previously described alphaviruses, for ex- 
ample SFV, have been reported to infect a very broad 
range of cultured cells, including those of mammalian, 
avian, amphibian, reptilian, and Insect origin (3, 14,4, 16). 
In addition, research performed in this laboratory has 
shown that SFV can produce infectious virus following 
inoculation of CHSE-214 cells incubated at 25°C, al- 
though productive infections did not occur when this 
virus was grown at 20°C or below (Phenix and Todd, 
unpublished results). In contrast, the cell culture growth 
of SPDV virus is restricted to a very narrow range of 
salmonidfish cell lines maintained at temperatures close 
to 15°C. 

Given the differences in the host species infected, it is 
not surprising that the levels of structural protein homol- 
ogy that exist between SPDV and other alphaviruses 
(32-33%) are less than those observed when previously 
characterized alphaviruses were compared (Table 2). 
Although, as might be expected for the nonstructural 
protein region, pairwise comparisons involving the C- 
terminal 394 amino acids of nsP4 show higher levels of 
identity (59-62%) between SPDV and other alphaviruses, 
these values are substantially less than those identities 
(75-83%) determined when other alphaviruses are com- 
pared over the same region. Our finding that the protein 
homology levels that SPDV shares with each of the other 
alphaviruses is relatively uniform (32-33%) supports the 
view that SPDV does not share a particularly close rela- 
tionship with any of the previously characterized alpha- 
viruses. Sequencing of the entire genome, which was not 
within the remit of this investigation, will be required 



before the exact phylogenetic relationship of SPDV to 
other alphaviruses can be established. 

Our sequence analysis has Identified clear molecular 
differences between SPDV and previously sequenced 
alphaviruses. At 91 nucleotides the 3'-nontranslated re- 
gion of SPDV is one of the shortest observed for any 
alphavirus. This region lacks repeat sequence elements 
that are detected in other alphaviruses and with which a 
possible role in virus RNA translation has been identified 
(28), In addition, SPDV shows the greatest level of nu- 
cleotide variation within the 19-nucleotide terminal se- 
quence. This region, which is highly conserved among 
alphaviruses, is reported to be involved in the virus RNA 
replication (13). Extensive molecular investigations will 
be required before the significance of such differences 
can be determined. 

Materials and Methods. Cells and virus. The F93125 
isolate of SPDV used in this investigation was grown in 
Chinook salmon embryo (CHSE-214) cells as previously 
described (73). For virus purification purposes, mono- 
layer cultures of CHSE-214 grown to -80% confluence in 
75-cm 2 flasks were infected with 1 ml virus to give a 
m.o.i. of —1. After 1 h adsorption, an additional 14 ml 
supplemented Eagle's minimum essential medium 
(MEM) was introduced to each flask (19). The virus- 
infected flasks were incubated at 15°C for 7 or 8 days, 
when virus-induced cytopathic effect was evident, and 
the supernatant was collected. 

Virus purification. The supernatant (typically 500 ml) 
from virus-infected cells was clarified at 1500 g for 20 
min. Polyethylene glycol (PEG) and NaCI were added to 
give final concentrations of 6 and 2.2%, respectively. 
Following overnight incubation at 4°C the PEG precipi- 
tate was collected by centrifugation for 90 min at 1500 g. 
The resultant pellet was resuspended in PBS (1-2 ml) 
and the crude virus suspension was fractionated by 
equilibrium density centrifugation using 11-ml gradients 
(20-60% w/w in TNE) of sucrose. After centrifugation for 
18 h at 50,000 g at 4 0 C, 1-ml fractions were collected 
from the bottom of the gradient Fractions containing 
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virus were identified by immunoblotting using an SPDV- 
specific mouse monoclonal antibody (Welsh, unpub- 
lished results}. 

Production of SPDV cDNA Clones. Viral RNA was ex- 
tracted from gradient-purified SPDV and virus-infected 
cells using RNA isolator (Genosys) and stored as ethanol 
precipitates. A cDNA library was first made by random 
priming with RNA extracted from gradient-purified virus. 
This library consisted of clones containing inserts (250- 
500 bp) in the vector pUC 18 (Sureclone ligation kit. Phar- 
macia). Clones were selected randomly from the library 
and following sequencing and analysis using the BLAST 
program (University of Wisconsin, Genetics Computer 
Group) were mapped to the "alphavirus" genome. The 
sequences of three such clones, N11, N38, and N50, 
which mapped to approximately 5.2, 2.7, and 1.1 kb from 
the putative 3' terminus of the "alphavirus" genome, were 
exploited to design oligonucleotide primers, which were 
used, in reverse transcriptiort-polymerase chain reac- 
tions (RT-PCR), to amplify three overlapping fragments 
encompassing the 5.2-kb region at the 3' terminus of the 
SPDV genome. The incorporation of Atari sites into the 
primers facilitated the restriction ligation of two of these 
fragments into the Atari site of the vector pBluescript 
(Stratagene) to produce clones pBS1150 and pBS5033. 
For pBSH50 first strand synthesis was carried out using 
the primer N50revNot(5'-GCATGCGGCCGCATGCGGAG- 
GACGTGGTCGCT-3 ') followed by PCR using this primer 
and the forward primer N11forNot (5'-GCATGCGGCCGC- 
CCG ACTATG GACTCAG CGG CA-3')- For pBS5038 first 
strand synthesis was carried out using the primer 
N38revNot (5'-GCATGCGGCCGCGCTGAACGCTTCGGC- 
ACGCTC-3') followed by PCR using this primer and the 
forward primer N50forNot {5'-GCATGCGGCCGCAGC- 
GACCACGTCCTCCGCAT-3'). Both PCRs were carried 
out using the Expand Long Template PCR System 
(Boehringer Mannheim) at 94°C for 30 s, 60°C for 30 s, 
68°C for 2 min. Clone pBS38R was produced using 3' 
RACE {6). The reaction was performed using a 573' 
RACE kit (Boehringer Mannheim) with some modifica- 
tions. Thus, RNA from gradient-purified virus was inde- 
pendently subjected to first strand cDNA synthesis using 
an oligo d(T) anchor primer (5'-GACCACG CGTATCGAT- 
GTCGAC(T) 16 -3'). Resultant cDNAs were amplified by 
PCR at 94°C for 30 s, 60°C for 30 s, 68°C for 1 min using 
the forward primer designed from N38, containing a A/off 
restriction site (N38forNot (5'-GCATGCGGCCGCTCA- 
GAGCGTGCCGAAGCGTTC-3') and the modified anchor 
primer (5 '-GCATGCGGCCG G G ACCACGCGTATCGATGT- 
CGA-3') containing a Atari site allowing cloning into 
pBluescript 

Sequencing ofSPDVcDNA Clones. Cycle sequencing 
was performed using the ABI PRISM dye terminator 
ready reaction kit on purified plasmid DNA following the 
manufacturer's protocol (Perkin-Elmer Cetus). Electro- 



pherograms were interpreted using the Sequence Navi- 
gator software (Perkin-Elmer Cetus). The sequencing 
strategy adopted for SPDV is shown in Fig. 1. Sequencing 
of the clones pBS1150, pBS5038, and pBS38R was ini- 
tially carried out using primers specific to the T7 and T3 
sequences within the plasmid vector and then by a 
primer walking strategy. The three constructs were se- 
quenced on both strands more than twice. Sequence 
comparisons were carried out using the GCG package, 
Wisconsin package version 8.1-UNIX, Genetics Com- 
puter Group Inc. 
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